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BACKGROUND OF THE INVENTION 



Field of the Invention 

The present invention relates to a method and apparatus for 
5 encoding moving pictures. In particular, the present invention relates to a method 
and apparatus for performing variable bit rate control in a digital video encoder 
while maintaining a particular overall bit-rate. 

BACKGROUND OF THE I NVENTION 

Description of the Related Art 

10 One of the main obstacles faced by industry dealing with digital video 

processing, storage and transmission is the large amount of data needed to 
represent analog video in the digital domain. Accordingly, digital compression is 
often applied to moving pictures to achieve a reduction in required transmission 
bandwidth or storage size. One variety of such compression techniques can be 

1 5 derived from the ISO/IEC MPEG Standards, the ISO/IEC 1 1 1 72-3(MPEG-1 ), the 
ISO/IEC 13818-2 (MPEG-2) and the MPEG-2 TM5 (test model 5), developed by 
the Moving Picture Experts Group of the International Organisation Organization for 
Standardization. The disclosures of those standards documents are hereby 
expressly incorporated into this specification by reference. MPEG-1 is the 

20 compression standard used in VideoCD while MPEG-2 is the video compression 
standard used in DVD and many digital broadcasting systems. 

The MPEG standards specify only the syntax of the compressed bit 
streams and method of decoding. The method of implementation in the encoder is 
left to the developer, and any form of encoder may be employed as long as the 

25 resulting bit stream conforms with the specified syntax. 



1 



V 



In certain applications such as video storage device (recorder), it is 
possible to use variable bit rate (VBR) encoding. A VBR encoder is able to vary its 
output bit-rate over a larger range than a CBR encoder, and this would generate 
an output which has a more constant visual quality. An example of a VBR encoder 
5 is described in United States patent number 5,650,860, entitled "Adaptive 
Quantization". In order to maintain a maximum bit rate allowed by the target 
storage device as well as an overall bit-rate which enables input picture sequence 
to be stored into a defined storage space, such VBR encoders utiliso utilize multiple 
encoding passes. 

10 In the first encoding pass, the bit util i sat i on utilization information is 

determined for each scene or each picture in the input sequence. This may be 
done by fixing the reference quant i sation quantization step size and disabling the 
VBV control. The determined bit ut i l i sation utilization information is then used to 
generate a bit budget for each scene or picture such that an overall target number 

1 5 of bits to code the sequence is fixed, and that the maximum bit rate is not violated. 
In cases that bit ut ili sat i on utilization information obtained is not close to that 
required for generating the bit budget, steps from the first coding pass must be 
repeated with an adjusted reference quantisat i on quantization step-size. The input 
sequence is coded in a final pass using the generated bit budget information to 

20 achieve the target bits or overall bit rate. 

Multiple-pass VBR encoders requires large storage memory for 
intermediate bit ut ili sation utilization information, and large computation needs for 
the additional passes and bit budget generation. Furthermore, such a VBR 
encoder cannot process input sequences in real-time as required by certain 

25 applications. 

BRIEF SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a single-pass real- 
time variable bit rate encoder for moving pictures. It is also an object of the 
present invention to provide variable bit rate encoding of moving pictures such that 
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the change in encoded picture quality from one scene to another is 
m i n i m i sod minimized . A further object is to provide a real-time variable bit rate 
encoding algorithm which produces a constant overall bit rate. 

In particular, the present invention encodes an input moving pictures 
5 sequence one segment at a time according to a target encoding quality which is 
determined by a target segment bit rate. The target segment bit rate of a current 
segment is preferably derived from the differences between the target segment bit 
rates and the actual coding bit rates of previous or previous few encoded 
segments. 

10 To maintain consistent encoding quality for all pictures within a 

segment, the actual target bit rate for encoding the pictures is made variable 
according to their scene complexities as well as the target encoding quality of the 
segment. 

As the target encoding quality of each segment is modified based on 
1 5 the differences between the target and the actual bit rates of previous or previous 
few encoded segments, the change of target encoding quality from segment to 
segment is made relatively smooth compared to that of a Constant Bit-Rate 
Encoder, and furthermore, the overall bit rate of encoding is maintained constant. 

In accordance with the present invention, there is provided a method 
20 for use in a moving pictures encoder for encoding a sequence of segments each 
having at least one image, comprising the steps of: 

a) determining an overall target bit rate for encoding the sequence of 

images; 

b) determining a bit allocation and target quant i Gat i on quantization 
25 step size for encoding a first segment on the basis of a segment target bit rate 

calculated using said overall target bit rate; 

c) encoding said first segment using a variable bit rate encoding 
method according to the target quant i sat i on quantization step size; 

d) determining a difference between the number of bits used to 
30 encode said first segment and said first segment bit allocation; 



e) distributing said difference for use in encoding at least one 
subsequent segment to determine a subsequent segment bit allocation; 

f) determining a new target quant i Gat i on quantization step size for 
encoding a said subsequent segment on the basis of a new target segment bit rate 

5 calculated using said segment target bit rate and the distributed difference; and 

g) encoding said subsequent segment using a variable bit rate 
encoding method according to the new target quant i sation quantization step size; 

wherein variable bit rate encoding is employed for encoding pictures 
within a segment whilst maintaining a substantially constant bit rate over said 
10 sequence. 

The present invention also provides a method for encoding moving 
pictures in a moving pictures encoder wherein a sequence of images are provided 
as input, the sequence of images comprising a plurality of segments each having a 
plurality of images, the method including: 
1 5 a) determining an overall target bit rate for encoding the sequence of 

images; 

b) maintaining a distribution record of bits from at least one 
previously encoded segment allocated for use in encoding at least one segment to 
be encoded; 

20 c) determining a target segment bit rate for a segment of the 

sequence of images on the basis of the overall target bit rate and a bit rate change 
calculated from the corresponding allocated bits from the distribution record; 

d) determining a target segment encoding quality from the target 
segment bit rate, a preceding target segment bit rate and a preceding target 

25 segment encoding quality,; and 

e) encoding the images of the segment according to the target 
segment encoding quality using a variable bit rate encoding technique taking into 
account scene complexities of the images in the segment; 

wherein maintaining said distribution record includes determining a 
30 difference between the number of bits used to encode a particular segment and 



the number of bits allocated for encoding the particular segment on the basis of the 
target segment encoding quality. 

According to the current invention, a moving pictures sequence is 
divided into segments. 
5 The size of each segment may be suitably determined. Each 

segment is encoded with a target encoding quality derived from its target segment 
bit rate. A variable bit rate (VBR) encoder is trtitise dutilized to encode the segment 
according to its target encoding quality. 

The target segment bit rate of an initial segment is obtained from a 
10 user defined target overall bit rate. After encoding the segment, the difference 
between the actual bit rate used and the target segment bit rate is obtained. This 
difference is propagated to the next or next few segments to be coded. This 
process is repeated for each segment; therefore, for each subsequent segment, a 
new target segment bit rate is determined from the user defined target overall bit 
15 rate and the differences between the target segment bit rates and actual bit rates 
of previous or previous few segments. 

The present invention further provides a method for controlling bit 
allocation in a moving pictures encoder for encoding a sequence of images 
comprising a plurality of segments each having a plurality of images, the method 
20 including, for each segment: 

determining a difference between a number of bits used for encoding 
a previous segment and a number of bits allocated for encoding the previous 
segment; 

calculating a bits distribution from the determined bits difference and 
25 a predetermined distribution function; 

calculating a bit rate change from the bits distribution and a 
predetermined number of images in the segment; 

calculating a target segment bit rate from the bit rate change and a 
predetermined target overall bit rate for the sequence of images; and 
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determining a target segment encoding quality from the target 
segment bit rate. 

The present invention further provides an encoding quality 
adjustment processor for generating a target segment encoding quality value in a 
5 moving pictures encoder for encoding a series of segments each having at least 
one image using a variable bit rate encoding scheme whilst maintaining a 
substantially constant overall bit rate, comprising: 

a bits difference computation means coupled to receive a segment 
encoding bit uti l isation utilization value and a target segment bit rate and generate 
10 therefrom a bits difference value representing a difference in bits allocated and bits 
used for encoding a segment; 

a bits distribution means coupled to the bits difference computation 
means for computing at least one bits distribution value from the bits difference 
value and a predetermined distribution function; 
15 a bit rate difference computation means coupled to the bits 

distribution means for computing a segment bit rate difference from the at least 
one bits distribution value and a predetermined number of images in a segment; 

a target segment bit rate adjustment means coupled to the bit rate 
difference computation means and the bits difference computation means for 
20 computing said target segment bit rate from the segment bit rate difference and a 
predetermined target overall bit rate for the sequence of segments; and 

an encoding quality computation means coupled to the target 
segment bit rate adjustment means for computing a target segment encoding 
quality value from said target segment bit rate. 
25 The present invention further provides a moving pictures encoder 

comprising: 

a coding processor for encoding picture data based on macroblocks 
according to a quant i sation quantization step size; 
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a virtual buffer processor coupled to the coding processor for tracking 
a number of bits used for encoding successive macroblocks in a picture and a 
number of bits used for encoding successive pictures in a group of pictures; 

a quant i Gation quantization step size processor coupled to the coding 
5 processor for determining said quant i sat i on quantization step size from a target 
number bits allocated for a picture and the number of bits already used for 
encoding macroblocks in that picture; 

a picture bit allocation processor coupled to the 
quantiGation quantization step size processor for determining said target number of 
1 0 bits allocated for a picture from a target bit rate and the number of bits already 
used for encoding pictures in a current group of pictures; 

a bit rate adjustment processor coupled to the picture bit allocation 
processor, the virtual buffer processor and the quant i sat i on quantization step size 
processor for determining said target bit rate from the number of bits already used 
1 5 for encoding successive pictures in the current group of pictures, a target encoding 
quant i Gation quantization step size and an average quantiGation quantization step 
size for pictures in the current group of pictures; and 

a target encoding quant i sation quantization step size processor 
coupled to the bit rate adjustment processor and the virtual buffer processor for 
20 determining said target encoding quant i Gat i on quantization step size from a 

predetermined target overall bit rate and the number of bits used for encoding a 
preceding group of pictures. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention is described in greater detail hereinafter, by way of 
25 example only, with reference to the accompanying drawings, wherein: 

Figure 1 is a block diagram of the constant bit-rate controller based 

on TM5; 
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Figure 2 illustrates the difference between a target bit allocation and 
an actual bits consumption at macroblock level for one frame (a P-picture is 
chosen for this case), where dj is the virtual buffer fullness at a particular instance; 
. Figure 3 is a block diagram of the variable bit-rate encoder with 
5 constant overall bit-rate control; 

Figure 4 is a block diagram of the constant overall bit-rate controller; 
Figure 5 illustrates how the difference in bit count for the last 
segment is redistributed over the next four segments (er-g ^e.g, f (m) = 1/4); and 
Figure 6 is a flow chart of a VBR algorithm with constant overall bit- 

10 rate control. 

DETAILED DESCRIPTION OF THE PREFERRED EM B OD I MENTS INVENTION 

In a standard MPEG compliant video encoder, a sequence of moving 
pictures ( o, g. e. g., video) is input to the encoder where it is compressed with a 
user defined target bitrate. The target bitrate is set according to the 

15 communication channel bandwidth in which the compressed video is to be 
transmitted, or the storage media capacity in which the compressed video 
sequence is to be stored. A typical MPEG encoder involves motion 
estimation/prediction, Inter/lntra classification, discrete cosine transform (DCT) 
computation, quantioation quantization . zig-zag scanning, variable length coding 

20 and rate-control. 

Several different forms of coding can be employed depending upon 
the character of the input pictures, referred to as l-pictures, P-pictures, or B- 
pictures. The l-pictures are intra-coded pictures used mainly for random access or 
scene update. The P-pictures use forward motion predictive coding with reference 

25 to previously coded l-or P-pictures (anchor pictures), and the B-pictures use both 
forward and backward motion predictive/interpolative coding with reference to 
previously coded -or P-pictures. Furthermore, a group of pictures(GOP) is formed 
in encoded order starting with an l-picture and ending with the picture before the 
next l-picture in the sequence. 



The pictures are partitioned into smaller and non-overlapping blocks 
of pixel data called macroblocks (MBs) before encoding. Each from a P-or B- 
picture is subjected to a motion estimation process in which forward motion 
vectors, and backward motion vectors in the case of a B-picture MB, are 
5 determined using reference pictures from a frame buffer. The target macroblock in 
the current picture is matched with a set of displaced macroblocks in the reference 
picture, the macroblock that best matches the target macroblock is used as the 
predicted macroblock. The position of this predicted macroblock is specified by a 
set of vectors known as motion vectors, which describe the vertical and horizontal 

10 displacement between the target and predicted macroblock. For a B-picture, the 
process is similar except there are two reference pictures: anchor pictures 
immediately preceding and following the B-picture. 

With the determined motion vectors, motion compensation is 
performed where the intra-or inter-picture prediction mode of the MB is first 

15 determined according to the accuracy of the motion vectors found, followed by 
generating the necessary predicted MB. l-pictures are always intra coded while for 
P and B-pictures, a decision on whether intra or inter coding will be used (at 
macroblock level) is made depending on which method will give rise to a more 
efficient coding. 

20 Transformation of the macroblock using a DCT is then carried out on 

the 8x8 pixel blocks within the macroblock. For intra coding, the actual picture 
data is coded, while for inter coding the prediction error is coded. This is followed 
by a quant i sation auantization process of the DCT coefficients which involves a 
quant i sat i on quantization matrix and a quantisation quantization step size. The 

25 quantized coefficients are then run-length encoded with variable length codes. 

The resultant bit usage and statistical data are passed onto a rate 
control module, which is used for allocating a target number of bits used to encode 
each picture and each macroblock within the picture. This is an important module 
in the encoder as it plays a major role in maintaining the quality of the encoded 

30 picture. 



The rate control in MPEG-2 Test Model 5 (TM5) comprises the steps 
of allocating the target amount of bits for coding each picture, deriving the 
reference quantisat i on quantization parameter (Q 7 ) to be used on each macroblock 
in a picture, and modulating the Q, based on the activity masking level of the 
5 surrounding blocks of the corresponding macroblock to obtain the modulated 
quantisation quantization step size (Mquant) used to quontisc auantize the 
macroblock. 

An objective of this rate controller is to ensure all pictures maintain a 
similar level of quality. It assumes that the subjective quality of a single coded 
10 picture can be qualified with a single number V, described by factor K/Q where K is 
a constant particular to a picture type and Q is the quantioation quantization step 
size for the picture being coded. That is, the rate controller will try to preserve the 
following equation: 

y _ K i ^ K p = K b 

1 5 The subscripts /, p, b refers to I, P, and B-picture types. The 

parameters K„ K p , and K b are experimentally determined constants for all I, P, and 
B pictures respectively, and K, usually is normalisod normalized to the value of 1 . 

Figure 1 illustrates a block diagram of a known embodiment of the 
TM5 rate controller. A group-of-pictures (GOP) is a collection of one I picture, 

20 some P pictures and B pictures, and serves as a basic access unit with the I 
picture as the entry point to facilitate random access. The controller comprises 
three levels of processing: a GOP level 1 1 1, a picture level 110, and a macroblock 
(MB) level 109. At the start of every new GOP at the GOP level 111, a GOP bit 
allocation process 100 computes the total number of bits (R g0 p) allocated for the 

25 GOP. The value of R gop is given by: 

bit _rate x N H * 

R 8 op = ~~ 4 ~ ; 

picture _rate 

where 
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bitjrate is the target bit rate for encoding the picture 

sequence, 

picture-rate is the number of pictures coded per second, and 
N is the total number of pictures coded in the GOP. 

The remaining bits (R) for the GOP is determined at the picture level 
1 10 by a remaining bits determination process 106. The value of R is updated as: 

R = R-S where S is the number of bits used by previously 

coded pictures in the GOP, and 
R = R +R gop for a new GOP. 

With the computed value of R, a picture bit allocation process 101 at 
the picture level 110 then computes a target bit value (7) allocated to the current 
picture according to the equations below for corresponding I, P or B picture type: 



T : = max 



R 



bit rate 



N p X p N b X b K x picture _ rate 
l + ~X~K b ~ + X~K b ~ 



(2) 



T p = max 



R 



bit rate 



+ N b K p X b ' K x picture _rate 

p+ XK b 
p b 



(3) 
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T b = max 



R 



bit rate 



N p K b X p ' K x picture _rate 



b p 



(4) 



where 



Ti, T p , T b Represents target bits for the next picture (I, P, B) 
N b , N p Represents the remaining number of pictures (P, B) in 

the GOP 
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X it Xp, X b Represents global complexity measures and gives a 
measure of the actual number of bits required to 
represent the picture (without compression), and 
Xi=QiXSi 

5 Xp = Qp x Sp 

X b = Q b x S b 

S„ S p , S b are the actual number of bits used to code 
the previous l/P/B frame. 
K s a constant (er^h -e.g, 8) 

1 0 After computing the target bits for the picture, a reference 

quant i oation quantization step-size(Qy) computation 102 is carried out at the MB 
level 109 for each macroblock within the picture. The value of Q y is computed 
based on the determined target bit allocation (7}, T p , or T b ) and a buffer fullness 
value (dj). Each of the three picture types has a virtual buffer associated with it 

15 and these buffers are updated after coding each macro block of the picture type 
according to the following equations: 

d d >+ B IlAL^ (5) 
1 0 " MB_Cnt 

T B x(j-\) 

d'=d 0 p +B M -^-^ — - 6 
1 0 yl MB_Cnt 

d b i =dl+B J ,- TbX ^ j ~^ (7) 
7 0 Jl MB_Cnt 

20 where 

do is the initial virtual buffer fullness, 

dj is the virtual buffer fullness when coding f h MB, 

Bj.i is the actual bits consumed up to and including 0' - 1) th 

MB as provided by a virtual buffer update process 
25 105, and 

MB_Cnt is the number of MBs in the picture. 
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Equations(5) to (7) effectively track the differences between the 
actual number of bits used and the target bits allocated, and these differences are 
then added to the respective virtual buffers which are used to compute Qy. This 
allows the rate controller to control the bits allocation based on the consumption 
5 pattern of the picture Qy is computed from the buffer fullness via the equation: 

Qj - (dj x 31 ) / r = dj x constant (8) 
where r is a reaction parameter = 2 x bitjrate I picture jrate 

The reaction parameter, as the name implies, is a factor that can 
control the sensitivity of the algorithm from a change dj. Large r will cause the 

10 reaction to be slower, which may cause the target bits and actual bits to differ 
significantly, but it brings about a more gradual rate of change in bit consumption 
which is favourab l o favorable for scene changes. On the other hand, a small r 
causes the controller to be more reactive to changes, giving rise to a closer target 
and actual bits value, but a faster response also means a less gradual change 

15 during scene change which is not desirable. A graph illustrating the difference 
between the target bit allocation and the actual bits allocated within a frame is 
shown in Figure 2. 

The value of Qy is further modified by an activity masking process 103 
to give the modulated quantisation quantization step size, Mquant. Basically, the 

20 spatial masking ability of the macroblock is used to place a correction factor onto 
the computed Q y value. Spatial masking ability is the ability to contain noise 
masked from human visual systems. Typically, an area with complex texture will 
have a larger correction factor than an area with simple texture, hence, 

Mquant -Njactj x Q, (9) 
25 where N_actj, is the norma li so normalize value of macroblock activity level and may 
have a value of between 0.5 and 2 

With the computed Mquant, the macroblock coding process 104 is 
performed where the macro block is compressed according to the MPEG 
standards. 
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At the end of coding a picture, the bit count S used for coding the 
picture is used to perform a VBV check 107 and to compute the remaining bits R 
for the current GOP at the remaining bits determination process 106. The VBV 
(video buffer verifier) is a virtual buffer emulating the status of the decoding buffer. 
5 The VBV check(1 07) is performed to detect any overflow or underflow of the 
decoding buffer with reference to the target bit rate. 

When coding a picture sequence with varying scene complexity, a 
constant bit rate (CBR) encoder is unable to vary the bit-rate according to the 
scene complexity. This results in scenarios whereby simple scenes are allocated 
10 more bits than required while complex scenes are allocated insufficient bits, and 
therefore results in a variation of output visual quality between scenes of different 
complexities. 

An embodiment of a moving pictures encoding apparatus according 
to the present invention is illustrated in block diagram form in Figure 3. The 

15 encoding apparatus as illustrated is arranged in four levels corresponding to 
processing stages in the bit rate control process, namely a segment level 313, a 
GOP level 312, a picture level 31 1 and a macroblock (MB) level 310. An input 
moving pictures sequence is encoded in such a way that on the average, or at the 
end of encoding, the output bit rate is close to a user definable target overall bit 

20 rate {overallJBR) 320. The input sequence is divided into segments and groups of 
pictures (GOP). The purpose of a segment is for monitoring of output bit rate with 
reference to the overail_BR> and the purpose of the GOP is primarily to facilitate 
random access. A segment may be defined to include a number of GOP(s) 
depending on the need in terms of frequency of monitoring output bit rate. Each 

25 GOP comprises at least an l-picture and optionally one or more P-pictures and/or 
B-pictures. Each picture is divided into macroblocks of pixels for encoding. 
Hence, a method of rate control according to the current invention may contain 
processes at the segment level 313, GOP level 312, picture level 31 1 , and finally 
macroblock level 310. 
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With a new group of pictures, a GOP bit allocation processor 300 
computes the number of bits allocated to the current GOP(R r gop ) as follows: 

R'=BRx ^ (10) 

picture _rate 

where: 

5 Ng 0 p is the number of pictures in the GOP, 

picturejrate is the number of pictures coded per second, and 
BR is a target bit rate. 

The target bit rate BR is determined for each picture or plurality of 
pictures by a bit rate adjustment processor 306 which is described hereinbelow. A 
10 remaining number of bits (R') for the GOP is determined at the picture level 31 1 by 
a remaining bits determination processor 308, which carries out the steps of: 

(a) before encoding the first picture in a GOP, adjusting f?' with a 

new R 'gop: 

R + = R gop, and then set BR 0 id, = BR 

1 5 (b) otherwise if the picture to be coded is not the first picture of a 

GOP, and a new target bit rate BR is determined, then adjusting FT with the new 
BR according to: 

R'+ = x (BR - BR old I and then set BR old = BR 

picture _rate 

where N is remaining number of pictures in the current GOP 
20 (c) and removing number of bits used by the last coded picture S 

from the remaining bits value R', hence: 

f?' = K'-S 

The computed R' is then passed to a picture bit allocation processor 
301 to compute the target bit (7) allocated to a current picture to be coded. For 
25 example, equations (2), (3) and (4) described hereinabove may be used for that 
purpose, with the parameters K p , K D , A/ p , N b and pictjtype being supplied. Note 
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that the lower limit of bitjrate (&*picture_rate) is only optional and may be adjusted 
if necessary. 

At the macroblock (MB) level 310, a reference 
quant iG at i on quantization step-size computation processor 302 computes a 
reference q ua ntisatio n q uantization step-size Q, for each MB using the computed 
target bit allocation 7 and a bit uti li sation utilization parameter B determined by a 
virtual buffer update processor 305. An example of a method for computing Q, for 
the/ 7 MB of the current picture is represented by: 

K x xDj x picture _rate 



1 0 where 



where 



Qj = 



K 2 xBR 



(11) 



Ki and K 2 
BR 



are constants (efl ^e.c?., 31 and 2 respectively), 
is the determined target bit rate, 
picture_rate is the number of coded pictures per second, and 
Dj is a virtual buffer fullness 

The virtual buffer fullness of the f h MB is determined by: 

K x x Dj x picture _ rate 



K 2 xBR 



(12) 



Do 
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is an initial buffer fullness before coding the current 
picture, ie w'.e.. Dj at end of the previous picture, 
is the bit ut i l i sation utilization information supplied by 
the virtual buffer update processor 305, ie W.e.. bits 
used to code the 1 st MB to (j-k 3 f MB, 
is the total number of MB in the current picture, and 
is a constant (eeh -e.g. 1 ). 

Three independent current and initial virtual buffers fullness values 
(Dj , D* , D* , £>q , £> 0 P , D* ) may be maintained for the three picture types (I- 
pictures, P-pictures, and B-pictures). The computed Qy may also be further scaled 
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MB_Cnt 
K 3 



by an activity masking processor 303 according to a masking factor determined by 
the surrounding activity levels of the MB to form the final quant i Gation quantization 
step-size, Mquant, for coding the current MB. A method of masking factor 
determination can be found in the aforementioned MPEG-2TM5. 
5 Macroblock coding (304) is then performed to encode the current 

MB. A macroblock coding processor 304 may employ, for example, methods 
according to the MPEG-1 or MPEG-2 video encoding standards. Such encoding 
methods include necessary motion compensation, discrete cosine transform, 
quant i Gation quantization with the determined quantiGat i on quantization step-size, 

10 and run-length encoding with variable length codes. The methods also include 
necessary decoding processes such that motion compensation can be performed. 
The number of bits uti li sed utilized by the macroblock coding processor 304 to code 
each MB is passed to the virtual buffer update processor 305. 

In applications where a maximum and/or minimum bit rate must be 

15 maintained for encoding, a VBV checking processor 307 is trttfised utilized . User 
definable maximum and/or minimum bit rates (max/min BR) are input at 321 to the 
VBV checker 307, a method of VBV checking according to the MPEG standards is 
used to examine bit ut i lisation utilization information from the virtual buffer 305, and 
necessary corrections are made to ensure compliance and that the output bit rate 

20 is within the defined max/min BR. 

At the end of coding a picture or a plurality of pictures, bit rate 
adjustment is applied by the bit rate adjustment processor 306 to provide any 
necessary correction to the current target bit rate BR. The correction is based on a 
target encoding quality provided by a target segment quality adjustment processor 

25 309 and a resultant encoding quality of previously coded picture(s) such that the 
overall encoding qualities of pictures within a segment are relatively close to the 
target constant. The resultant encoding quality may be determined by the average 
value of the reference quant i sation quantization step size (je7- /.e., average Qj) of 
the previously coded picture(s). A method of target bit rate (BR) determination can 

30 be derived from a rate quant i Gation quantization model as given by: 
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K. x (average Q, -target Q. ) 

BR = current _BR + — — y/ 

average _Qj 



(13) 



where 



current_BR is a current estimated bit rate, or 
current BR = S,. + N p x S p +N b x S b , 
5 Si, S p , S b are bits used by previously coded I, P, and B-pictures 

respectively, 

A/ p , N b are total number of P and B-pictures in the current 

GOP, 

average_Qj is the average value of Q, of previously coded 
10 picture(s) 

target_Qj is the target encoding quality or target value of Q,, 
K 4 may be a constant, or a factor of previous BR, 

max_BR, or current_BR, and K, may also be 
separately determined for I, P, and B-picture types. 

15 Maximum and/or minimum bit rates can be applied to the determined 

BR according to application requirements. The max/min BR input at 322 are used 
as given by: 

if (BR > max_BR), then BR = max_BR 
if (BR > min_BR), then BR = min_BR 

20 It is also possible to make use of the target__Qj at the macro-block 

level to increase efficiency of encoding. Basically, the target_Qj is used by the 
reference quant i sation quantization step-size computation processor 302 as a 
lower limit of the output reference quantiGation quantization step-size Qj such that 
when this target (quality) is reached at the MB level, bits are saved for future 

25 encoding immediately. In this case, the output reference quant i Gat i on quantization 
step size may be set according to: 

if (Qj< target _Qj), then Q. = target _Q r 
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The target bit rate BR is adjusted continuously with reference to a 
fixed target encoding quality within each segment. In turn at the beginning of each 
segment, the target encoding quality is adjusted so that a target overall bit rate is 
achieved. A segment may contain a few groups of pictures depending on the need 
5 in terms of frequency of monitoring output bit rate. A new segment may also be 
defined by a scene change detectable by any conventional scene change detector 
with a given range of time. 

At the onset of a new segment, the target segment quality 
adjustment processor 309 executes a check on the overall bit-rate based on bits 
10 usage and makes any necessary adjustments to the target encoding quality for the 
bit rate adjustment processor 306 so that the overall bit-rate converges to a user 
definable target overall bit-rate 320 (overall_BR). 

An example embodiment of a constant overall bit rate controller 
implementing the target segment quality adjustment process according to the 
15 present invention is illustrated in block diagram form in Figure 4. The bit count 
difference between the actual and target bit counts {bits_diff) for coding of a 
previous segment is calculated by a bits difference computation processor 401 , 
ut i lis i ng utilizing the steps of: 

a) obtaining the actual bits (bits_segment) used for coding the 

20 previous segment according to the input bit ut ilis at i on utilization of pictures (S) 400 
in the previous segment, and 

b) computing the value of bits_diff based on a corresponding target 
segment bit rate 406 (segment_BR) for the previous segment, according to: 

» . se gment _BRxN segment 

bits _diff bits ^segment (14) 

picture _rate 

25 where N seg ment is the number of coded pictures in the previous segment. 

The bit count difference(Ms_d//iO is then redistributed by a bit 
difference distribution processor 402 over the next k number of segments using a 
bit distribution function, f(m), expressed as: 
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delta _bits m = f(m) x bits _diff 

where 

m = \ f ...,k, 

5 delta_bits m is the bits difference distributed to next m th segment. 

For an example case of k = 4 and f(m) = Mk, Figure 5 illustrates how 
the bit count difference (bitsjdiff) of an encoded segment is distributed over the 
next 4 segments. The period of k segments used for bits compensation is referred 
to as an adjustment interval. A delta bit-rate computation processor 403 (Figure 4) 

1 0 accumulates all bit differences {delta_bits) distributed from previously encoded 
segments to the current segment to be coded, and computes a delta segment bit- 
rate (&segment_BR) based on the accumulated delta_bits and the number of 
pictures in the current segment to be coded. The delta segment bit-rate and a 
user definable target overall bit-rate (overall_BR) input at 408 are used by the 

1 5 target segment BR adjustment processor 405 to derive the target segment bit rate 
(segment_BR) 406 for the current segment. The value of segment_BR can be 
expressed as: 

segmenet _ BR = overall _ BR h- Asegment BR (16) 

A new target encoding quality (target_Qj) 407 for encoding of the 
20 current segment is then determined from the target segment bit rate using a BR-Qj 
modo l l i na modelina processor 404. The BR-Qj modc l lina modelina processor may 
operate, for example, according to: 

An [segmenet BR -segment BR') — 

AQj ^target _Q) x v * ~ f— = 1 (17) 

' 1 K 5 x segment BR 



25 where 



t arg et _Qj=t org et _ Q) + AQj 

target_Q'j is the target Q, of the previous segment, 
segmentJBR' is the target segment bit rate of the previous 
segments, and 
20 



K 5 is a constant which can be experimentally 

determined. 

A maximum limit may be placed on the determined AO, so that there 
is no drastic change in output quality from one segment to another. 
5 A flow chart for a process of variable bit rate (VBR) encoding with 

constant overall bit rate control according to an embodiment of the invention is 
illustrated in Figure 6. I nitialisation lnitialization of predetermined parameters is first 
carried out at step 600. Then, at step 601 the process determines whether 
processing is at the start of a new group-of-pictures (GOP) or not. Where a new 

10 GOP is determined, step 602 is carried out by updating the bit allocation for the 
new GOP. This may involve, for example, computation of the GOP bit allocation 
R'gop according to Equation (10) described hereinabove and accumulating to the 
remaining bit value R' to give the R' for the new GOP. 

For each picture in the current GOP, a target bit allocation value T is 

15 determined at step 603, and this may be computed according to Equations (2), (3) 
and (4), for example. Then, for each macroblock in the picture, a reference 
quant i sat i on quantization step size Q is determined using the computed target bit 
allocation 7, such as by the process represented by Equations (1 1 ) and (12). 
Activity masking may also be included in this step. The macroblock of the picture 

20 is then encoded at step 605 using the computed quantioation quantization step 
size, and steps 604 and 605 are repeated for all of the macroblocks in the picture 
until the end of the picture is determined at step 606. 

When the end of the sequence of moving pictures which is being 
encoded is reached, this is determined at step 607, which terminates the process 

25 at step 61 1 upon that occurrence. Otherwise, the process continues to step 608 
where it is determined whether the end of the current segment of pictures has 
been reached. 

When the end of a segment of pictures is reached and processing of 
a new segment about to begin, the target encoding quality is adjusted at step 609. 
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This is performed by firstly computing the difference between the number of bits 
allocated for coding the previous segment and the actual number of bits used in 
coding that segment, such as by the process represented by Equation (14). This 
quantity, bits_diff, represents extra bits which are left over from the previous 
5 segment of pictures, and may be distributed for use in encoding the ensuing 
picture segments. The left over bits bits_diff are distributed for use over one or 
more segments according to a bit distribution function, an example of which is 
described in connection with Equation (15). In essence, the left over bits from the 
previous segment are divided into a plurality of k, preferably equal, amounts and 

1 0 allocated to the following k segments for encoding with. The dividends allocated 
from all of the k previously processed segments are accumulated and used to 
determine a change in segment bit rate, &segment_BR, according to the 
accumulated bits difference and the number of pictures in the segment to be 
coded. This can then be used to compute an allocated bit rate for the segment to 

1 5 be coded according to Equation (16).Finally, a target encoding quality, target_Qj, 
can be computed according to Equations (17) and (18), for example. 

If the end of a segment has not been reached at step 608, or after 
the target quality adjustment of step 609, the target quality and the actual quality of 
encoded pictures is compared, and a new bit rate for encoding the next picture is 

20 computed based on the difference. This step can be carried out as described in 
relation to Equation (13), for example. The procedure then returns to step 601 , 
described above. 

In summary, embodiments of the present invention provide methods 
and apparatus for encoding moving pictures with a variable bit rate whilst 

25 maintaining a consistent output encoding quality according to a determined target 
quality. The target quality is adjusted at the beginning of every moving pictures 
segment according to a bit distribution function to ensure an overall bit rate that 
meets a defined target. Furthermore, the segment based bit distribution function 
provides smooth and flexible modulation of the encoding quality. The method can 
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be implemented at low cost and with the ability to encode an input moving pictures 
sequence in one-pass at real-time. 

It will be readily apparent to those of ordinary skill in the art that the 
foregoing detailed description of the present invention has been presented by way 
of example only, and is not intended to be considered limiting to the invention as 
defined in the claims appended hereto. In particular, it is envisaged that numerous 
variations to the embodiments as described can be made without departing from 
the spirit and scope of the invention. 

Throughout this specification and the claims which follow, unless the 
context requires otherwise, the word "comprise", and variations such as 
"comprises" and "comprising", will be understood to imply the inclusion of a stated 
integer or step or group of integers or steps but not the exclusion of any other 
integer or step or group of integers or steps. 
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GLOSSARY 
video buffer verifier 
variable bit rate 
constant bit rate 
Moving Picture Experts Group 
discrete cosine transform 
group of pictures 
macroblock 

quantisation parameter (picture quality) 
quantisation parameter for macroblock j 
quantisation parameters for I, B and P type pictures 
number of bits allocated for encoding a group of pictures 
target bit rate for a picture sequence 
number of pictures coded per second 
total number of pictures in the group of pictures 
remaining number of P, B pictures in the group of pictures 
remaining number of bits for encoding a group of pictures 
number of bits already used in coding a group of pictures 
target number of bits for coding next I, P, B type picture 
global complexity measures for I, P, B type pictured 
virtual buffer fullness when coding the j th macroblock 
initial virtual buffer fullness 

actual bits used up to and including(/-1) th macroblock 

number of macroblocks in the picture 

reaction parameter (= 2 x bitjrate / picture jrate) 

normalised macroblock activity level 

modulated quantisation step size (= N_actj x Qy) 

definable target overall bit rate 

target bit rate 
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